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Abstract 

Emerging in December 2019, coronavirus disease 2019 (COVID-19) eventually 
became a pandemic and posed a tremendous threat to global public health. However, 
the origins of SARS-CoV-2, the causative agent of COVID-19, remain to be determined. 
It has been reported that a certain number of the early case clusters had a contact history 
with the Huanan Seafood Market. Therefore, surveillance of SARS-CoV-2 within the 
market is of vital importance. Herein, we presented the SARS-CoV-2 detection results 
of 1380 samples collected from the environment since 1st Jan and animals since 18th 
Jan within the market in early 2020. By SARS-CoV-2-specific RT-qPCR, 73 
environmental samples tested positive for SARS-CoV-2 and three live viruses were 
successfully isolated. The viruses from the market shared nucleotide identity of 99.99% 
to 100% with the human isolate HCoV-19/Wuhan/IVDC-HB-01/2019. The A lineage 
(8782T and 28144C), as the likely ancestral SARS-CoV-2 lineage, was found in an 
environmental sample. No virus was detected in the animal swabs covering 18 species 
of animals in the market. The RNA-seq analysis of SARS-CoV-2 positive/negative 
environmental samples showed the abundance of different vertebrata genera. In 
summary, this study provided convincing evidence of the prevalence of SARS-CoV-2 


in the Huanan Seafood Market during the early stage of COVID-19 outbreak. 


Keywords: 
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Infections with novel human coronavirus 2019 (HCoV-19) !2, named as severe acute 
respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on 
Taxonomy of Viruses (ICTV) 2, can result in coronavirus disease 2019 (COVID-19), 
named by World Health Organization (WHO) characterized by various clinical 
outcomes from asymptomatic infections to severe pneumonia and even death ^". 
Globally, as of Feb 28" 2023, over 758 million confirmed cases and over 6.8 million 


deaths have been reported (covid19.who. int). 


Early human cases with COVID-19 were first reported in late December 2019 from 
Wuhan, China, with pneumonia of unknown etiology (PUE), and a majority of them 
were found to be linked to the Huanan Seafood Market (HSM) in Wuhan *°, where 
various animal meats, exotic seafoods and live animals were available for purchase. 
Therefore, the HSM has been suspected to be the source of the COVID-19 pandemic ?. 
However, alternative hypothesis that some individuals outside the market had brought 
the virus into the market through human-to-human transmission or cold chain could not 
be ruled out, considering some of the early cases without epidemiological link to the 


t 68. Yet, more scientific evidence and further studies are indeed needed. 


marke 
Considering the findings that SARS-CoV-2 has high similarities with a few 
coronaviruses derived from bats in Asian countries including China, Laos, Japan, 
Cambodia and Thailand, scientists have proposed that bats might be the original source 
of SARS-CoV-2 1914. However, it is still a mystery whether another animal might act 
as an intermediate host to have facilitated the virus spillover from bats to humans "5:16, 
One of such important findings was the discovery of SARS-CoV-2 related 
coronaviruses from pangolins that harbored highly similar receptor binding domain 
(RBD) with that of SARS-CoV-2 in the viral spike (S) protein 1719, Despite that 
pangolins might be involved in the ecology of coronaviruses, whether the pangolins are 
the intermediate hosts is yet unknown, given the current data? A recent study 


documented the animal species in the HSM between May 2017 and November 2019 
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and noted that no pangolins or bats, but some hypothesized Sarbecovirus-susceptible 
animals such as Raccoon dogs were present ?!. Thus far, the origins of SARS-CoV-2 
2225 and the role of the HSM in the origins and spread of SARS-CoV-2 remain unclear. 


The data from the HSM may provide important information in this field. 


The HSM is located in the Jianghan District, a downtown of Wuhan, the capital city of 
Hubei Province, and is approximately 800 m far from Hankou Railway Station, a major 
railway travel hub. It occupies >50,000 m?, with 678 stalls located close to each other 
in an extremely crowded condition (Fig. 1A). The market is separated into two zones, 
East and West Zones, with seafood and animals mainly sold in West Zone and livestock 
meat in East Zone. Among the 678 stalls of the market, 10 domesticated wildlife 
animal-selling stalls (1.596) were identified, located in the south-western corner of West 
Zone (8/10) and the north-western corner of East Zone (2/10), respectively (Fig. 1A). 
According to sale records, during late December 2019, animals or animal products were 
sold in these 10 animal stalls. Animals included snakes, avian species (chickens, ducks, 
gooses, pheasants and doves), Sika deer, badgers, rabbits, bamboo rats, porcupines, 
hedgehogs, salamanders, giant salamanders, bay crocodiles and Siamese crocodiles, 
etc., among which snakes, salamanders and crocodiles were traded as live animals 
(described in detail in the Report of WHO-convened global study of origins of SARS- 
CoV-2 24), 


The market was closed in the morning of January 1*, 2020, shortly after the 
identification of the PUE. At the same time, in order to investigate the potential 
introduction of SARS-CoV-2 into the market, Chinese Center for Disease Control and 
Prevention (China CDC) dispatched an epidemiological team, together with experts 
from Hubei Provincial CDC and Wuhan Municipal CDC, to the HSM to collect 
environmental samples in the early morning of January 1°, 2020 (Fig. 1B). From 
January 1%, 2020, to March 2", 2020, a total of 923 environmental samples from 


different locations within and around the market and 457 animal samples including 


116 animal bodies, stray animals and their feces, were collected, with some stray animals 
117 sampled until March 30" (Extended Data Tables 1, 2, 3 and Supplementary Table 1). 
118 This may reflect the profile of SARS-CoV-2 contamination in the market during the 
119 early phase of the outbreak. After the closure of the market, the outside surface of the 
120 rolling shutter doors of the stalls and the corridors were disinfected (with 1% bleach 
121 mixed with water) throughout January and February 2020. The goods inside the stalls 
122 were completely cleared and disinfected until early March 2020. 

123 

124 Out of the 923 environmental samples collected in and around the HSM, 73 were found 
125 to be positive for SARS-CoV-2 with positive rate of 7.9% through the nucleic acid test 
126 (МАТ), with cycle threshold (CT) values of real-time polymerase chain reaction (PCR) 
127 ranging from 23.9 to 41.7 (Table 1). Among the 828 samples inside the HSM, 64 
128 samples (7.7%) were positive. For the 14 samples from warehouses related to the HSM, 
129 five tested positive. Among the 51 samples from sewerage wells (Supplementary Table 
130 1) in the surrounding areas outside the HSM, three were tested positive (Table 1). 
131 Notably, one sample (Env 0601) out of the 30 environmental samples, a floor surface 
132  swab collected from Dongxihu Market in Wuhan on January 22nd, 2020, was also 
133 tested positive (Table 1, Extended Data Table 4). 

134 

135 For the 64 SARS-CoV-2 positive samples inside the HSM, 87.5% (56/64) were 
136 collected in the West Zone of the market, in particular streets from no. 1 to 8, with 71.4% 
137 (40/56) positive samples identified herein (Fig. 1A). Of the 110 samples collected from 
138 sewers or sewerage wells in the market, 24 samples were positive for SARS-CoV-2 
139 nucleic acid. All the four sewerage wells in the market tested positive. During the onsite 
140 investigation of the overground drainage pathway in the HSM, we found that the 
141 wastewater in the overground drainage was led into the underground drainage inside 
142 the market and then flow into the wells on the edge of the market. We then did a spot- 
143 check sampling across all the overground drainages according to the principles 


144 described in the Methods (Extended Data Fig. 1). In fact, the excreta of upper 
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respiratory tract of the patients and the potential animal waste would be mixed together 
into the overground drainage. Thus, these data suggested that either the infected people 
and/or animals in the market contaminated the sewage or that the contaminated sewage 
might have further played a role in the virus transmission within the case cluster in the 


market. 


The merchants’ activities were assessed against the NAT results of the environmental 
samples. The sampling covered 19.8% (134/678) of the vendors in the market (95% 
confidence interval (CI): 16.8-23.0%). Of the positive samples, 44 were distributed 
among 21 vendors in the market, 19 of whom were located in the West Zone and the 
remaining two located in the east area (Fig. 1A). Some vendors sold more than one 
product type. While the results provided some indication of association of cases with 
different products, no significant differences were observed between different vendors, 
including poultry (22%, 8/37: 95% CI: 9.8-38.2%), cold-chain products (18.4%, 16/87, 
95% CI: 10.9-28.1%), aquatic products (17.8%, 13/73, 95% CI: 9.8-28.5%), livestock 
(14%, 5/36: 95% CI: 4.7-29.5%), seafood products (11%, 6/56: 95% CI: 4-21.9%), 
wildlife products (11%, 1/9: 95% CI: 0.3-48.2%), and vegetables (25%, 2/8: 95% CI: 
3.2-65%) (Extended Data Fig. 2, Extended Data Table 5). The positive vendors with 
multiple product types suggested that SARS-CoV-2 might have been circulating in the 
market, especially the West Zone, for a while in December 2019, leading to an extensive 
distribution of the virus within the market, which might have been facilitated by the 


crowded buyers and the contaminated environment. 


The 457 animal samples included 188 individuals belonging to 18 species (with some 
stray animals sampled until March 30") (Table 2). The sources of the samples included 
unsold goods kept in refrigerators and freezers in the stalls of the HSM, and goods kept 
in warehouses and refrigerators related to the HSM. Three Chinese giant salamanders, 
which were found in a fish tank, were alive and swab samples were collected and tested. 


Samples from stray animals in the market were also collected, i.e. swab samples from 
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10 stray cats, 27 cat feces, one dog, one weasel, and 10 rats. All the 457 animal samples 


were tested negative for SARS-CoV-2 nucleic acid. 


To determine whether there was live virus in the HSM, we inoculated 27 SARS-CoV- 
2 positive environmental samples collected on January 1*, 2020, into cell lines, 
including Vero E6 and Huh7.5 cells. Cytopathic effects (CPE) were observed 3 days 
post inoculation with sample Env_0313 on Vero E6 cells. CPE was also observed 5 
days post inoculation on Huh7.5 cells. The electron micrographs of Vero E6 cells after 
5 days of post inoculation showed that virus particles were present in both the 
supernatant and the cells. Negative-stained virus particles and ultra-thin cultured cell 
sections showed typical coronavirus morphology (Fig. 2). Totally, live viruses were 
isolated from samples Env_0313, Env_0354 and Env_0126, which were the only three 
samples with CT values <30 in the NAT. Env_0354 and Env_0126 were swab samples 
of the ground and Env 0313 were swab samples of the wall. Notably, samples 
Env_0313 and Env_0354 were from the stalls with confirmed patients. All the results 
of successful virus isolation and the CT values of the original samples revealed the 
existence of live SARS-CoV-2 with high titers in the environment of the HSM. We did 
not perform virus isolation based on the samples collected from later time points due to 


the high CT values. 


During later sampling in the HSM on February, we specially collected some samples to 
investigate the virus RNA persistence in the Market. Some of the samples were still 
tested positive, especially in the sewage well and even on the walls (Table 1). For the 
sample Env 0838 on the wall collected on February 20", 2020, the results of the three 
PCR targets were 32.59/‘-”/37.34, accordingly. This result is reasonable considering 
the degradation of the viral genome and some of the PCR target cannot be detected. 
However, the results also indicate a long persistence of the virus RNA in the 


environment. 
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We further performed high-throughput sequencing (Extended Data Table 7) and 
successfully obtained seven complete or near complete SARS-CoV-2 genome 
sequences, including three sequences from three environmental samples (Env 0313, 
Env 0354 and Env 0020), and four sequences from cell supernatants of Env 0313, 
Env 0354 and Env 0126 (Fig. 3, Extended Data Table 7). The genome sequences of 
three environmental samples, Env 0126, Env 0313 and Env 0354, were found to be 
completely identical to the reference strain HCoV-19/Wuhan/IVDC-HB-01/2019 
(IVDC-HB-01, GISAID accession number: EPI ISL 402119) and the human strain 
Wuhan-Hu-1 (GenBank: NC 045512) (Fig. 3A). The genome sequence of the isolated 
virus from environmental sample Env 0354 had two synonymous mutations compared 
to HCoV-19/Wuhan/IVDC-HB-01/2019, with sequence identity of 99.99% (Fig. 3A). 
Therefore, the SARS-CoV-2 sequences from environmental samples were highly 


similar to the clinical strains obtained during the early stage of the COVID-19 outbreak. 


Previously, SARS-CoV-2 has been proposed to be classified into two major lineages 
based on the two highly-linked single nucleotide polymorphisms (SNPs): A lineage 
(8782T and 28144C, or S lineage in another nomenclature of SARS-CoV-2) and B 
lineage (8782C and 28144T, or L lineage). It has been proposed that A/S lineage most 
likely is the ancestral lineage, because all of the SARS-CoV-2 related coronaviruses 


С ?926 while Pekar et al. also 


from bats and pangolins possessed 8782T and 28144 
presented a possibility that both lineages represent separate introduction events”. 
Phylogenetic analysis revealed that most of the environmental strains belong to the B/L 
lineage and they cluster together with the human strains circulating in the early stage 
of the pandemic (Fig. 3B, Supplementary Fig. 1). However, the environmental sample 
Env_0020 falls within the A/S lineage in the tree (Fig. 3B), with the confirmation by 
the high number of reads mapped to positions 8782 and 28144 in Env_0020 (Extended 
Data Table 8). However, it should be noted that the genome of Env_0020 is of low 


quality and there are many discontinuous gaps in the assembled genome. Indeed, 


though it is difficult to root the SARS-CoV-2 phylogenetic tree, our analysis indicated 
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that the environmental viruses cluster together with the human strains circulating in the 


early stage of the pandemic. 


We conducted RNA-seq analysis using 60 SARS-CoV-2 PCR-positive and 112 SARS- 
CoV-2 PCR-negative environmental samples from the HSM (Fig. 4A and Extended 
Data Table 6). Bacteria were the most abundant species in almost all samples and 
mammal species could be found in most samples, which fit the feature of samples 
collected from the environment (Fig. 4B and Supplementary Table 2). We further 
classified the vertebrate genera using the barcode of life data system. The Gallus, Homo, 
Anas, Sus, Bos, and Canis could be detected in most samples (Fig. 4C and 
Supplementary Table 3), which was in accordance with the environmental feature of 
the seafood markets in China. In addition, The most different genus in the bacteria 
domain belonged to Flavobacteriia class (higher in SARS-CoV-2 PCR-positive 
samples) and Clostridia class (higher in SARS-CoV-2 PCR-negative samples) 
(Supplementary Table 4). We analyzed the mammal genera in all sequenced samples 
with the kranken2 (detailed in the method) using different thresholds (Fig. 4D and 
Supplementary Fig. 3). A total of 70 mammal genera, which existed in more than 2% 
samples, were identified with a threshold of 100 reads per million (Fig. 4D and 


Supplementary Table 5). 


Particularly, we analyzed three samples (the B5, F13, and F54) collected on 1“ Jan 2020 
with high levels of SARS-CoV-2 (Ct value <30) (Fig. 4E). The identified mammal 
genera in the F13 and F54 were related to species in the general food market, such as 
Homo, Ovis, Bos, Canis, Sus, and Felis. Many mammal genera were observed in the 
B5 sample, but the most abundant mammal genera were also related to the general food 
market, including Bos (77.30%), Ovis (19.91%), Homo (0.77%), and Bubalus (0.57%). 
Although Pipistrellus, a kind of bat, can also be found in this sample, but the relative 
ratio was low (0.002%), and the Pipistrellus is unlikely the kind of bat that host SARS- 


CoV-2 as previously reported. Meanwhile, the Lutra, another suspected host for SARS- 
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CoV-2, was also observed in this sample, but the relative ratio was extremely low 
(0.001%). It should be noted that mammals share some sequences in genomes. Thus, 
the more mammal reads detected, the more species will be found, and those identified 
species/genera with relatively low supporting reads were likely unreliable in those 
samples with abundant detected mammal reads. Moreover, we also noted that only 
Homo, Ovis, Bos, and Sus reads but not species related to wildlife were found in the 


Env_0020 samples, the one that falls within the A/S lineage in the tree. 


In addition, we illustrated the top-ranked genera in four areas of the market, where 
multiple SARS-CoV-2 PCR-positive samples were detected. As shown in Fig. 4F, the 
top-ranked genera in these areas were homo or other genera generally existed in food 
markets. We also noted the Nyctereutes could be found in the vender 25 of the street 8, 
while Atelerix and Erinaceus could also be found in vender 15-17 of street 7 (Fig. 4F). 
However, these genera existed in both SARS-CoV-2 PCR positive and negative 
samples (Supplementary Table 2, 3), not to mention the positive ratio was much higher 


in SARS-CoV-2 PCR negative samples. 


Also, we checked samples that might relate to wildlife, such as samples collected in the 
defeathering machine and the visible blood spot. Top abundant mammal genera of the 
defeathering machine sample (Env_0584) was Canis (Extended data Fig 3). Top 
abundant mammal species of the visible blood spot (Env_0262) were Bos, Sus, Ovis 
and Bison, accordingly (Extended data Fig 3). Additionally, we plotted the distribution 
of some genera of concern, including Myotis, Erinaceus, Mustela, Nyctereutes, 
Rhizomys, Meles, and Melogale. Most of these samples were distributed in the western 
district of the market (Extended data Fig 4), where they sold wildlife products. 
Although the enriched areas of SARS-CoV-2 PCR-positive samples were nearby, the 
distribution locations of Homo, Sus, Bos, Gallus and Anas were also dominated in this 
area. The repeated sampling of the locations with PCR-positive results may contribute 


some bias on the distribution analyses of enriched areas of SARS-CoV-2 PCR-positive 
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samples. Moreover, we plotted the distribution of mammal genera in those SARS-CoV- 
2 positive samples with high abundance of genera related to wildlife, such as the 
Env 0576 (Nyctereutes enriched), Env 0807 (Lariscus enriched), Env 0809 
(Erinaceus enriched), and Env. 0585 (Erinaceus enriched) (Extended Data Fig. 3). We 
noted multiple wildlife-related genera could be identified, and the relative ratio was low 
when using the kraken2 methods, except for the Atelerix and Erinaceus, genera that 


have been reported in the WHO report. 


Of particular note was the difference in the results from PCR and NGS. In fact, among 
the 60 SARS-CoV-2 PCR-positive samples for RNA-seq analysis, 39 samples tested 
negative by NGS (no SARS-CoV-2 reads at all) (65.096), including the samples 
Env 0262. For these NGS-negative samples, the CT values ranged from 31.80 to 37.44. 
Since the RT-PCR detection assay employed in the very early stage of the pandemic 
was not formally verified, we believe that there might be some false positives in the 
PCR detection results in this study. Meanwhile, we also found the SARS-CoV-2 reads 
could also be detected in a portion of SARS-CoV-2 PCR negative samples (15.2%), 
which might be caused by the degradation of SARS-CoV-2 within the PCR target region. 


In summary, SARS-CoV-2 RNA was detected in stalls in the West Zone of the HSM, 
suggesting the prevalence of SARS-CoV-2 in the market. Thus, although the origin of 
the virus cannot be determined from all the analyses available so far, the market might 
at least have acted as an amplifier due to the high number of visitors every day, causing 
many initially identified infection clusters in the early stage of the outbreak ”. In 
addition, live SARS-CoV-2 viruses also existed in the environmental samples. More 
importantly, this is the first time for SARS-CoV-2 virus isolation from environmental 
samples. No SARS-CoV-2 was detected in the animal samples from the HSM. However, 
it should also be noted that samples analyzed in the current study were somehow biased, 
and wildlife-related vendors and early case-related vendors were prioritized for sample 


collection. Actually, the gene barcode analysis of animal species in the study showed 
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Myotis, Nyctereutes and Melogale that have been recognized as potential host species 
of Sarbecoviruses were mostly detected within the SARS-CoV-2 PCR negative 


environment samples. 


Meanwhile, recent reports traced the outbreak back to the HSM and suspected that the 
market sold live animals as recently as 2019, by compiling information reported by 
various sources, including the WHO-China Joint Report and social media, etc?*. 
Another report hypothesized that SARS-CoV-2 was spilled over from animals to 
humans at least twice in November or December 2019, even the raccoon dog was 
suspected as the intermediate animal”. Nevertheless, the new evidence pinpointing to 
the market does not support such a hypothesis and an alternative hypothesis could be 
further studied ??. Our study confirmed the existence of raccoon dog and other 
hypothesized/potential SARS-CoV-2 susceptible animals through gene barcode within 
the market before the closure. However, these environmental samples cannot prove the 
infection of the animals. Furthermore, even if the animals were infected, it could not 
still rule out that the human-to-animal transmission occurred, considering the sampling 
time was at least one month after the human-to-human transmission within the market. 
Thus, the possibility of potential introduction of the virus through human or cold chain 


product into the market cannot yet be ruled out. 


Definitely, more work involving international coordination is needed to investigate the 
potential origins of SARS-CoV-2 4. Surveillance of wild animals using a viromic 
approach should be enhanced to explore the potential natural and intermediate hosts for 
SARS-CoV-2 759 if any, which would help to prevent future pandemics caused by 


animal-origin coronaviruses or alike, with a spillover event. 
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Figure legends 

Fig. 1. The distribution of the positive environmental samples in the Huanan 
Seafood Market. 

A. As the place of the early cluster of COVID-19 patients, the Huanan Seafood Market 
is separated into East and West Zones with the Xinhua Road between them. To detect 
for the presence of SARS-CoV-2 RNA, reverse transcription, quantitative polymerase 
chain reaction (RT-qPCR) was performed. The locations of the positive samples were 
marked in the map of the market within orange, while the location of the samples that 
the live viruses were isolated from were labeled with red. The map also shows locations 
of stalls where domesticated wildlife products were sold. B. Timeline of environmental 
and animal samples collected within and around the Huanan Seafood Market. The 
information of confirmed patients up to December 31st 2019 was referenced from the 


Report of WHO-convened global study of origins of SARS-CoV-2. 


Fig. 2. The SARS-CoV-2 virus isolation from environmental samples of the 
Huanan Seafood Market. 

The electron micrographs of the SARS-CoV-2 viruses isolated from the environmental 
samples in the Huanan Seafood Market. To determine whether SARS-CoV-2 particles 
could be visualized from the cell supernatant and lysate, we used transmission electron 
microscopy (EM) to observe the culture supernatant and ultra-thin section cells based 
from both VeroE6 and Huh7.5 cells. The electron micrographs showed that virus 
particles were present in both the supernatant (A, B) and the cells (C, D). Negative- 
stained virus particles were generally spherical, pleomorphic and 60-140 nm in 
diameter. Spike protrusions were observed around the particles in a crown (corona) 
shape (A, B). In ultra-thin cultured cell sections, a group of virus particles can be seen 


outside the cell (C), and sheets of virus particles can also be observed inside the cells 


(D). 


Fig. 3. Genomic and phylogenetic analyses of SARS-CoV-2 virus genomes from 
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the Huanan Seafood Market. 

A. Sequence comparison of the full-length SARS-CoV-2 genomes in the environmental 
samples. B. Phylogenetic analysis of full-length SARS-CoV-2 genomes from the 
Huanan Seafood Market and representative strains from the early stage of the COVID- 
19 pandemic, showing that most environmental strains cluster together with the human 
strains in the B/L lineage, with Env_0020 in A/S lineage. 

Fig. 4. Analysis of environmental samples in the Huanan Seafood Market. 

A. Schematic illustration of the experimental design. All 73 SARS-CoV-2 positive 
samples were included for RNA-seq. A total of 60 RNA-seq libraries were successfully 
constructed. Additionally, RNA-seq libraries of 112 SARS-CoV-2 negative samples 
passed library quality control. The kranken2 was used for genus classification. The 
bowtie2 and sequences in the barcode of life data system was used for the classification 
of genus in the Mammalia class. B. Heatmap showing the reads distribution of the four 
domains (Bacteria, Eukaryota, Viruses and Archaea), the Homo genus, the Mammalia 
class and the SARS-CoV-2 species. SARS-CoV-2 PCR-positive or -negative were 
shown in the left panel. C. Positive ratio of illustrated genus in all tested samples. Top 
ranked genus within the Mammalia class were shown. D. Illustration of mammal genera 
in market using the threshold of 100 reads per millions. The samples were group by 
SARS-CoV-2 PCR results. The blue bar indicates the positive detected genera. E. 
Illustration of mammal genera distribution in samples with high viral load. The 
Env_0020, Env_0313, Env_0354 and Env_0126 were shown. 

F. Distribution of the positively detected Mammal genera in the market. Samples in four 
areas where multiple SARS-CoV-2 PCR-positive samples were plotted. The 


distribution of top mammal genera in each area was shown. 
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527 Tables 


528 Table 1. Information of the positive environmental samples from the Huanan Seafood 


529 Market’. 


Sample 
ID 

Env 0275 
Env 0101 


Env 0014 


Env 0015 
Env 0018 
Env 0002 
Env 0020 
Env 0033 
Env 0055 
Env 0061 
Env 0063 
Env 0087 
Env 0088 
Env 0090 
Env 0096 
Env 0138 
Env 0126 
Env 0213 


Env 0262 


Env 0221 


Env 0400 
Env 0313 
Env 0333 
Env 0346 
Env 0354 
Env 0398 
Env 0509 


Env 0552 


Env 0576 
Env 0579 
Env 0583 


Env 0584 


Env 0585 


Env 0660 
Env 0664 
Env 0682 
Env 0686 
Env 0717 
Env 0719 
Env 0742 
Env 0743 
Env 0744 
Env 0745 
Env 0746 
Env 0747 
Env 0748 
Env 0750 
Env 0752 
Env 0762 
Env 0828 
Env 0829 


Lab code 


E61 
A101 


А14 


А15 
A18 
A2 
А20' 
АЗЗ 
А55 
A61 
A63 
A87 
A88 
A90 
A96 
B17 
В5» f 
D32 


E48 


E7 


F100 
БІЗ»! 
F33 
F46 
F545f 
F98 
G93 


Q37 


Q61 
Q64 
Q68 


Q69 


Q70 


1-27-33 
1-27-37 
1-29-4 
1-29-8 
8-25-D 
8-25-MI 
WS-1-1 
WS-1-2 
WS-1-3 
WS-1-4 
WS-2-1 
WS-2-2 
WS-2-3 
WS-3-2 
WS-4-2 
zong-1 
w-6-29-33 
w-zong-1 


Sample type 


Ground 

Surface of the door 
Surface of packing bag 
for hairtail 

Surface of the door 
Shoe covers and soles 
Ground 

Gloves 

Garbage truck 

Ground 

Ground 

Ground 

Surface of the door 
Ground 

Ground 

Ground 

Scale 

Ground 

Surface of a cart 

Blood on the ground in 
front of the door 
Styrofoam desk in front 
of the door 

Ground 

Surface of the wall 
Roller shutter 

Ground 

Ground 

Ground 

Sewage 

Inner surface of the 
freezer 

Cartl 

Cart2 

Ground 

Feather removal 
machine 

Iron container in inner 
room 

Water drain 

Water drain 

Water drain 

Water drain 

Ground inside the stalls 
Ground inside the stalls 
West sewage well 1 
West sewage well 2 
West sewage well 3 
West sewage well 4 
West sewage well 1 
West sewage well 2 
West sewage well 2 
West sewage well 3 
West sewage well 4 
Water drain 

Water drain 

Water drain 


Sampling date 


2020/1/1 
2020/1/1 


2020/1/1 


2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 


2020/1/1 


2020/1/1 


2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 
2020/1/1 


2020/1/12 


2020/1/12 
2020/1/12 
2020/1/12 


2020/1/12 


2020/1/12 


2020/1/27 
2020/1/27 
2020/1/29 
2020/1/29 
2020/2/3 
2020/2/3 
2020/2/5 
2020/2/5 
2020/2/5 
2020/2/5 
2020/2/5 
2020/2/5 
2020/2/5 
2020/2/5 
2020/2/5 
2020/2/9 
2020/2/15 
2020/2/15 


19 


Street 


No. 


ene мА ммм ммм BBD 


© 
чл 


I AAN бо WDBRNNNKY SF N 


—O о 
o 


DE ZZ ZZZ ZZZ ZO UM 


чл 


— 


>> р> р р> р р р р 


Vendor 
No. 

1-3 

19 


26 


26 
15-17 
15-17 
15-17 
NA 
25 
20-22-24 
16-18 
28 

28 

26 

24 
X44 
6-8 


X6-X4 
15 

17 

24 

14 
X6-X4 
19-21-23 


25 


29-31-33 
29-31-33 
29-31-33 


29-31-33 


29-31-33 
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`Р 
(95) 
чө 


Z 
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PCR 


+++++++++++ +++ ++ +++ 


++++++ + + 


t+ ++ 00 


+++++ +++ +++ ++ + + ++ 


35.9 

36.5 

36.00 
38.00 
34.01 
38.00 
37.30 
37.65 
36.72 
37.60 
37.10 
34.94 
37.97 
36.54 


PCR target 


ORFlab/N 
ORFlab/N 


ORFlab/N 


ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 


ORFlab/N 


ORFlab/N 


ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 
ORFlab/N 


ORFlab/N 


ORFlab/N 
ORFlab/N 
ORFlab/N 


ORFlab/N 


ORFlab/N 


ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 


NGS * 


Yes 
Yes 


Yes 


Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Yes 


Yes 


Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Yes 


Yes 
Yes 
Yes 


Yes 


Yes 


No 
No 
No 
Yes 
Yes 
Yes 
Yes 
No 
No 
No 
Yes 
No 
Yes 
No 
Yes 
No 
Yes 
Yes 


Env 0813 
Env 0815 
Env 0816 
Env 0817 
Env 0818 
Env 0820 
Env 0821 
Env 0830 
Env 0806 
Env 0807 
Env 0808 
Env 0809 


Env 0838 


Env 0862 


Env 0863 


Env 0865 


Env 0867 


Env 0868 


Env 0601 


Env 0790 
Env 0797 


Env 0802 


530 Four positive samples were also included, that were not collected from the Huanan Seafood Market. 
531 Опе sample was a floor surface swab collected from Dongxihu Market in Wuhan on January 22th, 
532 2020. and 3 collected from sewerage wells in the surrounding areas collected on February 9", 2020. 


EWS-2#-2 
EWS-3#-2 
WWS-1l# 
WWS-1#-2 
WWS-1#-3 
WWS-2#-2 
WWS-2#-3 
wws- 1#-0 
W-8-25-D1 
W-8-25-D2 
W-8-25-L 
W-8-25-L2 


C8 


SJ-D 


SJ-CS 


SJ-L3 


RLC-4 


RLC-3 


WH-17 


CSSQ-1-3 
YCHC2-1-1 


HXJC4-1-3 


East sewage well 2 
West sewage well 3 
West sewage well 1 
West sewage well 1 
West sewage well 1 
West sewage well 2 
West sewage well 2 
West sewage well 1 
Ground inside the stalls 
Ground inside the stalls 
Container 

Container 


Wall inside the stalls 


Storehouse ground 


Storehouse weight scale 


Storehouse wire fence 


Storehouse bag surfaces 


Storehouse bag surfaces 


floor surface swab 
collected from 
Dongxihu Market 
sewerage well in 
surrounding area 
sewerage well in 
surrounding area 
sewerage well in 
surrounding area 


2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 
2020/2/15 


2020/2/20 


2020/2/22 


2020/2/22 


2020/2/22 


2020/2/22 


2020/2/22 


2020/1/22 


2020/2/9 


2020/2/9 


2020/2/9 


NA 


533 ° Тһе live viruses were isolated from the three samples. 


534 © “Yes”, Samples with NGS analysis; “No”, Samples without NGS analysis due to the limited RNA 
535 sample amount. 

536 ‘The PCR results of Env 0552, Env 0576 and Env 0585 were negative. There were no CT values 
537 for these samples. The PCR results of Env 0579, Env 0583 and Env 0584 were positive, but the 
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stairl-2 


25 


25 


25 


25 


25 


МА 


МА 


МА 


МА 


538 CT values for these samples were not recorded from the laboratory. 


539 °Тһе“-” indicates the negative result for the N gene target. 


540 f The SARS-CoV-2 genomes were sequenced from the four environmental samples or their cell 


541 supernatants after virus isolation. 
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+ +++ +++ + + 


+ 


+ 


+ 


40.21/4 
0.1/36.3 
7 

41.77/4 
1.62/37. 


36.05 


ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 
ORFlab 


RdRp/N/E 


RdRp/N/E 


RdRp/N/E 


RdRp/N/E 


RdRp/N/E 


RdRp/N/E 


ORF lab 


ORF lab 
ORF lab 


ORF lab 


Yes 


Yes 


No 


Yes 


No 


Yes 


542 Table 2. The animal samples collected in the Huanan Seafood Market. 


Species Animal Sample RT-PCR positive 
number number number 

Rabbit/Hares 52 104 0 
Stray cat 27 80? 0 
Snake 40 80 0 
Hedgehog 16 67 0 
Muntjac 6 18 0 
Dog 7° 17 0 
Badger 6 16 0 
Bamboo rat 6 15 0 
Rat" 10 12 0 
Pig NAS 64 0 
Chicken 5 5 0 
Chinese giant salamander 3 5 0 
Crocodile 2 4 0 
Wild boar 2 4 0 
Soft-shelled turtle 2 3 0 
Weasel* 1 2 0 
Fish 2 2 0 
Sheep 1 1 0 
Others NAf 16 0 
Total 188 457 0 


543 * Six of the cats were from the Huanan Seafood Market. And the samples included 
544 faeces. 

545 "Including one stray dog in the Huanan Seafood Market. 

546 ° Not applicable due to the processed pork. 

547 ‘Collected from other markets. 

548 “The weasel was not sold in the market, but caught alive in the Market. 

549  fNotapplicable due to the unrecognized “bai Пао” product as described in the Extended 
550 data methods. 
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563 


564 


565 


566 


567 


568 


569 


570 


571 


572 


573 


574 


575 


576 


577 


578 


579 


Extended data methods 
Sample collection 

The Huanan Seafood Market (HSM) was closed in the early morning of January 1st 
2020, and at the same time, China CDC began collecting environmental and animal 
samples. Staff from China CDC entered the market about 30 times before the market’s 
final clean-up on March 2™ 2020, with some stray animals sampled outside the market 
until March 30". Environmental samples in the HSM were collected to represent 
exhaustively as possible, from a wide diversity of surfaces, animals and products (Table 
I and 2) according to different sampling principles, as described in detail in the Joint 
Report of WHO-convened Global Study of Origins of SARS-CoV-2: China Part !. 

The principles and ranges of in-market sampling covered: (1) environmental samples 
from stalls related to early cases; (2) environmental samples from doors and floors of 
all stalls in the blocks where the early cases were located; (3) environmental samples 
in the East Zone of the market were collected according to blocks; (4) transport carts, 
trash cans and similar objects; (5) environmental samples from stalls that sold livestock, 
poultry, farmed wildlife (also called “domesticated wildlife” or “domesticated wildlife 
products” in this report); (6) samples of sewage and silt from drainage channels and 
sewerage wells; (7) stray cats, rats and other stray animals in the market; (8) animal 
products and other commodity samples kept in the cold storages and refrigerators in the 
market; (9) the market’s ventilation and air-conditioning system; and (10) public toilets, 
public activity rooms and other places where people gathered in the market. 

The investigators used full personal protective equipment during the sampling in the 
market. Commercial products of swabs and virus preservation solution were used for 
the sampling (Disposable Virus Sampling Tube, V5-S-25, Shen Zhen Zi Jian 
Biotechnology Co., Ltd., Shenzhen, China). For environmental samples, sampling 
swabs were applied to smear the floors, walls or surfaces of objects and then preserved 
them in virus preservation solution. 

For animal samples, depending on the type of animal and whether it was alive or 


frozen, pharyngeal, anal, body surface and body cavity swabs or tissue samples were 
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collected for nucleic acid testing (NAT). Generally, for alive animal and frozen full 
bodies, three samples, including pharyngeal, anal and body surface swabs were 
collected for each animal individuals. And for animal bodies after “bai tiao” disposing 
(remaining parts of poultry or livestock after removal of hair and viscera), the body 
cavity swabs were collected. 

Drain samples were collected by the use of virus sampling swabs to probe into the 
silt at the bottom of drainage channels in the market. Wastewater and silt samples were 
preserved in virus preservation solution. For the sewage well (for the drain water), a 
container was used to take a silt-water mixture from a location near the bottom of the 
well, and an appropriate amount of sample was collected by using virus sampling swabs 
and then preserved in virus preservation solution. 

Nucleic acid extraction and SARS-CoV-2 real-time PCR assay 

A virus nucleic acid extraction kit (Xi'an Tianlong) was used to extract viral nucleic 
acid from samples using an automated nucleic acid extraction instrument according to 
the manufacturer’s instructions. Real-time (RT) PCR was performed on extracted 
nucleic acid samples with a SARS-CoV-2 nucleic acid assay kit. The reagent brands 
include BioGerm (40/38, cycle number/cut-off value, the same as below), DAAN 
(45/40) and BGI (40/38). 

Virus isolations 

Virus isolations were performed in biosafety level (BSL)-3 laboratory in National 
Institute for Viral Diseases Control and Prevention, China CDC. Samples positive for 
SARS-CoV-2 were cultured in Vero E6 and Huh7.5 cells on January 11", 2020. The 
cell lines were inoculated with positive samples and three blind passages were 
performed for each sample. The culture supernatant and cell pellet of each passage were 
harvested for RT PCR. The morphology of viral particles in the cell sections and the 
supernatant were firstly observed by transmission electron microscope (TEM) on 
January 22™, 2020. 

Metagenomic sequencing 


Metagenomic sequencing was conducted at National Institute for Viral Disease Control 
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and Prevention, China CDC and Wuhan BGI. Nucleic acid was extracted using Qiagen's 
viral RNA microextraction kit and human nucleic acid was removed using an 
enrichment kit to improve the sensitivity of viral RNA detection. Extracted RNA was 
reverse transcribed into cDNA and segmented into 150-200 bp by enzyme digestion. 
After repair, fitting, purification, PCR amplification and purification, sample 
concentration was assayed by DNBSEQ-T7, and an average output of more than 200 
million reads was obtained. Sequencing data were compared with those in a SARS- 
CoV-2 database to determine whether the samples contained coronavirus sequences. 
For the seven complete SARS-CoV-2 genome sequences, three sequences from 
environmental samples (Env 0020 seq01, Env 0313 seq02 and Env 0354 seq03) 
were obtained from DNBSEQ-T7, and four sequences from cell supernatants of 
Env 0313, Env 0354 and Env. 0126 (Fig. 3) were obtained from NextSeq 550 platform. 
A few samples were re-sequenced using a multiplex PCR approach, including 
Env 0020 seq01, Env 0313 seq04, Env 0313 seq05, Env 0126 seq06, and 
Env 0354 seq07 (Extended Data Table 6 and 7). All raw data related to the genomes, 
including any partial genomes that were sequenced were fully reported and deposited 
to the public database (Extended Data Table 6 and 7). 

Virus genome assembly and phylogenetic analysis 

Raw reads were adaptor- and quality- trimmed with the Fastp (version 0.20.0) program. 
The clean reads were mapped to the SARS-CoV-2 reference genome (GenBank: 
NC 045512) using Bowtie2. The assembled genomes were merged and checked using 
Geneious (version 11.1.5) (https://www.geneious.com). The coverage and depth of 
genomes were calculated with SAMtools v1.10 based on SAM files from Bowtie2. 
Reference genomes, IVDC-HB-01 (GISAID: EPI ISL 402119) and Wuhan-Hu-1 
(GenBank: NC 045512), were employed as a query. Multiple sequence alignment of 
the seven SARS-CoV-2 sequences obtained from this study and reference sequences 
were performed with Mafft (v7.450). Phylogenetic analyses were performed using 
RAxML v8.2.9 with 1000 bootstrap replicates, employing the GTR nucleotide 


substitution model and the Gamma distribution. 
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Bioinformatic analysis of the species abundances 

The Kraken2 (version 2.1.2)? was used for species classification with the option *-- 
confidence 0.1”. Sequences of all species in the Nucleotide (nt) database were used for 
generating the index. The bracken (version 2.5) was used for re-evaluating species 
abundance. The matrix of species was obtained by using the pavian algorithm'. ggplot2 
package in R was used for plotting. Read counts of each genus were used for further 
analysis and plotting. Raw counts of four domains (Archaea, Viruses, Eukaryota, and 
Bacteria), SARS-CoV-2, Homo genus, and Mammalia class were shown by heatmap 
(4B). Two tail unpaired t-test was used for identification of differential genus between 
SARS-CoV-2 PCR-positive and -negative samples. 

For the analysis of the mammalian genus characterization, the reference was generated 
using the sequence of mitochondrial cytochrome c oxidase subunit I (COI-5P) in the 
barcode of life data (BOLD) system**. RNA-seq samples were mapped to the reference 
sequences by the bowtie2’ algorithm with the default settings. Read counts of each 
genus were calculated by the samtools®. Read counts over 20 were used as cut-off for 
the identification of positively enriched genus. Fisher’s exact test was used for 
comparing the differential genus in the Mammalia class between SARS-CoV-2 PCR- 


positive and -negative samples. 


Ethics 
The sample collection was determined by China CDC to be part of the emergency 
responses to the pneumonia of unknown etiology (PUE) and therefore exempt from 


institutional review board assessment. 
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Extended Data Figure Legends 

Extended Data Fig. 1. The overground drainage pathway in the Huanan Seafood 
Market and environmental sample collection. 

The wastewater in the overground drainage was lead into the underground drainage 
inside the market and then flow into the wells on the edge of the market. And we did a 
spot-check sampling across all the overground drainages. To detect for the presence of 
SARS-CoV-2 RNA, reverse transcription and quantitative polymerase chain reaction 
(RT-qPCR) were performed. The locations of the positive samples were marked in the 
map of the market within yellow. 

Extended Data Fig. 2. Positive environmental samples associated with different 
products in the Huanan Seafood Market. 

Dots represent the percentage of positive environmental samples associated with each 
product. Bars represent 95% confidence intervals for the binomials in the text above. 
Note that the confidence interval (CI) for some products (e.g. vegetables, farmed 
wildlife) have broad error bars that are likely due to the low number of vendors for 
these categories in the market. Nine of the 10 vendors selling farmed wildlife have been 
sampled. 

Extended Data Fig. 3. Illustration of mammal genera distribution in samples of 
concerns. Illustration of mammal genera distribution in samples of concerns. Samples 
related to the blood spot and the de-feather machine (Env_0262 and Env_ 0584) and 
samples enriched with genera related to wildlife (Env_0576, Env_0807, Env_0809, and 
Env_0585) were plotted. Animal genera identified by the BOLD method were shown 
in the left panel, while mammal genera identified by the kraken2 method were shown 
in the right panel. 

Extended Data Fig. 4. Distribution of the positively detected Mammal genera in 
the market. The distribution of SARS-CoV-2 and potential host were plotted by yellow 
and blue dots, respectively. The density of the distribution of potential host was shown 


in red, while the SARS-CoV-2 by green. 
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Supplementary information guide for the SI files 


Supplementary Table 1. The information of the 1380 samples collected from or around 
the Huanan Seafood Market (HSM). Sample ID: Unique ID for each sample. Lab code: 
Temporary IDs used for sampling and laboratory testing. Sampling location: The 
relative location of the sampling site to the HSM, or the relationship of the site to the 
SHM, whether it is inside the HSM (east/west), near the HSM, or other. Type of vendor 
sold product including aquatic, seafood, poultry, livestock, wildlife, vegetable, and 
cold-chain product: Yes, means that the vendor where the sample was collected sells 
this type of product, and No means that the vendor where the sample was collected does 
not sell this type of product. These pieces of information together indicate the scope of 
business of the vendor where the sample was collected. 


Supplementary Table 2. Distribution of different genus in each sample. 


Supplementary Table 3. Classification of genus within the Mammalia class using the 
barcode of life data (BOLD) system. 


Supplementary Table 4. The differential of genus between SARS-CoV-2 PCR-positive 
and -negative environmental samples. 


Supplementary Table 5. Risk of co-existence with SARS-CoV-2 positive samples. 
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739 Extended Data Tables 


740 Extended Data Table 1. Overview of environmental sample sampling and testing in the 
741 Huanan Seafood Market. 


Number of Number of positive Number of 
samples samples by RT-PCR isolated viruses 

Huanan Seafood Market 718 40 3 

Warehouses related to the 14 5 

Huanan Seafood Market* 

Other markets in Wuhan and 

Huanggang? 2d | 

ggang 

Drainage system in the Huanan 110 24 

Seafood Market 

Sewerage wells in surrounding 51 3 

areas 

Total 923 73 3 


742 а The warehouses related to the Huanan Seafood Market were located out of the market. 

743 > The one positive sample outside HSM was collected from Dongxihu Market in Wuhan. More 
744 information was provided in Extended Data Table 4. 

745 


746 
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Extended Data Table 2. The collection logic of the environment samples. 


. 152 Sample 
No. | Time Objective . Amount | Sum 
time 
(1) Environmental samples from stalls related to 
early cases; (2) Environmental samples from doors 
and floors of all stalls in the blocks where the early 
1 1,Jan cases were located; (3) Environmental samples in 1,Jan 515 515 
the east wing of the market were collected 
according to blocks; (4) Transport carts, trash cans 
and similar objects. 
Environmental samples from stalls that sold 
2 12,Jan livestock, poultry, farmed wildlife (also called 12,Jan 70 70 
domesticated wildlife). 
Environmental samples from other markets in 
3 |22Jan 22,Jan 30 30 
Wuhan 
23,Jan 23 
25,Jan 2 
4 23,Jan- | The outdoor environmental samples from stalls that | 3,Feb 16 52 
19,Feb sold livestock, poultry, farmed wildlife. 9, Feb 
15,Feb 4 
19,Feb 
27,Jan 38 
5 27,Jan- | Samples of sewage and silt from drainage channels | 29,Jan 26 Bi 
]5,Feb | and sewerage wells in the market. 9,Feb 9 
15,Feb 21 
6 5,Feb- | Samples of sewage and silt from city sewerage 5,Feb 32 i 
9,Feb wells around the market. 9,Feb 39 
(1) Cold storages and refrigerators from stalls that | 20,Feb 27 
sold livestock, poultry, farmed wildlife in the 22,Feb 12 
7 20,Feb- | market; (2) The market's ventilation and air- 23,Feb 1 gi 
2,Mar conditioning system; (3) Public toilets, public 25,Feb 2 
activity rooms and other places where people 29,Feb 15 
gathered in the market. 2,Mar 34 
Total 923 
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750 Extended Data Table 3. The collection logic of the animal samples. 


А m Sample 
No? | Time Objectives . Amount | Sum 
time 
8 22,Jan Animal products in other markets. 22,Jan 6 6 
25,Jan 55 
20,Feb 23 
Animal products and other 
21,Feb 36 
25,Jan- commodity samples kept in the cold 
9 Å | 23,Feb 5 306 
10,Mar storages and refrigerators in the 
25,Feb 47 
market. 
2,Mar 75 
10,Mar 65 
27,Jan 5 
5,Feb 3 
10 27,Jan- Live animals captured around the 9,Feb 2 m 
1,Mar market. 15,Feb 3 
29,Feb 2 
1,Mar 2 
18,Jan 1 
27,Jan 12 
28,Jan 8 
29,Jan 21 
TT Stray cats, mice, cat feces and other | 5,Feb 10 
,Jan- 
11 stray animals (one dog and one 15,Feb 2 96 
30,Mar 
weasel in the market). 23,Feb 2 
14,Mar 2 
20,Mar 2 
22,Mar 4 
30,Mar 32 
Animal products and other 19,Feb 28 
19,Feb- . . 
12 commodity samples kept in the cold 32 
23,Feb 23,Feb 4 
storages. 
Total 457 


751 “Тһе number follows the upper Table for environment samples. 
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753 Extended Data Table 4. The information of the sampling in other markets. 


Numberof Number of Number of | Number of 


er Е ag Е animal positive animal 
District environment positive environment samples Ё 
samples" by RT-PCR samples samples 
by RT-PCR 

Jiang'han district 7 0 2 0 
Jiang’an district 8 0 2 

Donxihu district 7 1 1 0 
Huanggang city 8 0 1 0 
Total 30 1 6 0 


754 . *Swab sample collected from the floor, wall or chopping board. 
755 The heart, liver and large intestine tissues from pigs. 
756 
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757 Extended Data Table 5. Twenty-one venders of nucleic acid testing (NAT) positive in the 
758 Huanan Seafood Market. 
Product types? 
Cold- Aquatic Seafood DE 
А : : Wildlife 
Vendors No. Location chain product product Poultry Livestock Vegetables 
products S S Pen 
1 West no no no yes no no no 
2 West yes yes yes no no no no 
3 West yes yes no yes yes yes no 
4 East yes no no yes yes no no 
5 West no no no no no no no 
6 West no yes no yes yes no no 
7 West yes no no yes no no no 
8 West yes yes yes yes no no no 
9 West yes yes yes no no no no 
10 West yes yes yes yes yes no no 
11 West yes yes no no no no no 
12 West yes yes yes no no no no 
13 West yes yes no no no no no 
14 West yes yes no no no no no 
15 West yes yes no no no no no 
16 West yes yes no no no no no 
17 West no no no no no no no 
18 West yes no no yes yes no no 
19 West no no no no no no yes 
20 West yes no no no no no yes 
21 East yes yes yes no no no no 
Sum of NAT positive 
16 13 6 8 5 1 2 
Vendors sampled in the 
study selling such 87 73 56 37 36 9 8 


759 
760 


products 


а *ves” indicates product sold by vendors; “по” indicates product not sold by vendors. 
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761 Extended Data Table 6. Information of the high-throughput sequencing data of 

762 environmental samples from Huanan Seafood Market’. 

No. SampleID Lab Code GISAID_Accession SARS-CoV-2 Sample type Street No.: Vendor No. 

b qPCR result 

1 Env 0002 А2 EPI ISL 13052326 Positive Ground 15-17 

2 Env. 0014 A14 EPI ISL 13052323 ТН Surface of packing 4 26 
bag for hairtail 

3 Env 0015 А15 EPI ISL 13052324 Positive Surface of the door 4 26 

4 Env. 0018 A18 EPI ISL 13052325 Positive Shoe covers and soles 7 15-17 

5 Env 0020 А20 EPI ISL 13052327 Positive Gloves 7 15-17 

6 Env. 0033 A33 EPI ISL 13052328 Positive Garbage cart 7 NA 

7 Env 0055 А55 ЕРІ ІСІ, 13052329 Positive Ground 7 25 

8 Env 0061 — A61 EPI ISL 13052330 Positive Ground 7 20-22-24 

9 Env 0063 A63 EPI ISL 13052331 Positive Ground 7 16-18 

10 Env. 0087 А87 EPI ISL 13052332 Positive Surface of the door 4 28 

11 Env 0088 A88 EPI ISL 13052333 Positive Ground 4 28 

12 Env 0090 А90 EPI ISL 13052334 Positive Ground 4 26 

13 Env 0096 А96 EPI ISL 13052335 Positive Ground 4 24 

14 Env. 0101 A101 EPI ISL 13052322 Positive Surface of the door 4 19 

15 Env 0126 B5 EPI ISL 13052337 Positive Ground 5 6-8 

16 | Env 0138 B17 EPI ISL 13052336 Positive Scale 15 X44 

17 Env. 0213 D32 EPI ISL 13052338 Positive Surface of a cart 15 15 

18 Епу 0221 ЕТ? EPI ISL 13052341 ie Styrofoam desk in 2 5 
front of the door 

19 Env. 0262 E48 EPI ISL 13052339 — Blood on the ground 9 22 
in front of the door 

20 Env 0275 E61 EPI ISL 13052340 Positive Ground 6 1-3 

21 Env. 0313 F13 EPI ISL 13052343 Positive Surface of the wall 11 15 

22 Env 0346 F46 EPI ISL 13052344 Positive Ground 2 24 

23 Env 0354  F54 EPI ISL 13052345 Positive Ground 2 14 

24 Env 0398 F98 EPI ISL 13052346 Positive Ground 4 X6-X4 

25 Env 0400 F100 EPI ISL 13052342 Positive Ground 4 X6-X4 

26 Env. 0552 Q37 EPI ISL 13052297 RM Inner surface of the 8 25 
freezer 

27 Епу 0576  Q61 EPI ISL 13052298 Positive Cart1 6 29-31-33 

28 | Env 0579 064 EPI ISL 13052299 Positive Cart2 6 29-31-33 

29 Env 0583  Q68 EPI ISL 13052300 Positive Ground 6 29-31-33 

30 Env 0584 069 EPI ISL 13052301 — Feather removal 6 29-31-33 
machine 

3l Env. 0585 Q70 EPI ISL 13052302 БЕТ Iron container іп inner 6 29-31-33 
room 

32 Env 0660 1-27-33 EPI ISL 17064533 Positive Water drain 6 29-33 

33 Env. 0664 1-27-37 EPI ISL 17064534 Positive Water drain 10 4 

34 Env 0682 1-29-4 EPI ISL 17064535 Positive Water drain 4 5-7-9 
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Env 0686 
Env 0717 


Env 0719 


Env 0742 
Env 0746 
Env 0748 
Env 0752 
Env 0790 
Env 0802 
Env 0806 


Env 0807 


Env 0808 
Env 0809 
Env 0813 
Env 0815 
Env 0816 
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139 Env 0715 8-25-р EPI ISL 17064621 Negative Knief 8 25 
140 Env 0716 8-25-CK EPI ISL 17064622 Negative Sewage 8 25 
141 Env 0720  825-M2 EPI ISL 17064623 Negative Ground 8 25 
142 Env 0749 WS-3-1 EPI ISL 17064624 Negative Sewage well NA NA 
143 Env 0751  WS-4-1 EPI ISL 17064625 Negative Sewage well NA NA 
144 Env 0753  8-25-DI EPI ISL 17064626 Negative Ground 8 25 
145 Env 0754 8-25-О2 ЕРІ ISL 17064627 Negative Ground 8 25 
146 Env 0756 8-25-Long EPI ISL 17064628 Negative Container 8 25 
147 Env 0759 Outside-5 EPI ISL 17064629 Negative Water drain 5 NA 
148 Env 0768 НХЈМНІ-1- ЕРІ ISL 17064630 Ke Sewage well NA NA 
2 
149 Env 0770  JXRJ-1-1 EPI ISL 17064631 Negative Sewage well NA NA 
150 Env 0788 CSSQ-1-1 EPI ISL 17064632 Negative Sewage well NA NA 
151 Env 0789 = CSSQ-1-2 EPI ISL 17064633 Negative Sewage well NA NA 
152 Env 0798  YCHC2-1-2 ЕРІ 151, 17064634 Negative Sewage well NA NA 
153 Env 0799  YCHC2-1-3 ЕРІ ISL 17064635 Negative Sewage well NA NA 
154 Епу 0800  HXJCA4-1-1 EPI ISL 17064636 Negative Sewage well NA NA 
155 Env 0800 НХЈС4-1-2 ЕРІ ISL 17064637 Negative Sewage well NA NA 
156 Env 0839 C9 EPI ISL 17064638 Negative Ground 5 NA 
157 Env 0842 C12 ЕРІ ISL 17064639 Negative Wall 3 NA 
158 Env 0847 C17 EPI_ISL_17064640 Negative Surface of container 6 34 
159 Env 0856 Е-10-29-2 EPI ISL 17064641 Negative Container 10 29 
160 Env 0858 E-A-7-1 EPI ISL 17064642 Кайа Container Accessory 7 
Street 

161 Епу 0873  CCDC-454 EPI ISL 17064643 Negative Container 9 34 
162 Env 0875 629-1-L1 EPI ISL 17064644 Negative Container 6 29 
163 Env 0877 629-3-С EPI ISL 17064645 Negative Cart 6 29 
164 Env 0879 629-5-L4 EPI ISL 17064646 Negative Container 6 29 
165 Env 0882 629-8-L7 EPI ISL 17064647 Negative Container 6 29 
166 Env 0885 629-11-L EPI ISL 17064648 Negative Container 6 29 
167 Env 0887 629-13-L EPI ISL 17064649 Negative Container 6 29 
168 Env 0889 629-L-1 EPI ISL 17064650 Negative Container 6 29 
169 Env 0893  CCDC-744 EPI ISL 17064651 Negative Ground 6 29 
170 Env 0895 CCDC-746 EPI ISL 17064652 Negative Container 6 29 
171 Env 0907  CCDC-758 ЕРІ ISL 17064653 Negative Container 8 25 
172 Env 0909 CCDC-760 ЕРІ ISL 17064654 Negative Ground 8 25 


763  *Allthe raw sequencing data have been uploaded onto the GISAID (China CDC Weekly, 2021, 
764 DOI: 10.46234/ccdcw2021.255). The raw sequence data reported in this paper have also been 
765 deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics, 2021, 
766 DOI: 10.1016/j.gpb.2021.08.001) in National Genomics Data Center (Nucleic Acids Res, 
767 2022, DOI: 10.1093/nar/gkab951), China National Center for Bioinformation / Beijing Institute 
768 of Genomics, Chinese Academy of Sciences that are publicly accessible at 
769  https://ngdc.cncb.ac.cn/gsa (GSA: CRA010170). Raw sequence data was deposited into NCBI 
770 ВіорРгојесі under accession number PRJNA948658 
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771 
772 
7/3 
774 
775 
776 
777 
778 
779 


(http://www.ncbi.nlm.nih.gov/bioproject/948658) and іп China National Microbiology Data 
Center (NMDC) with accession numbers NMDC10018366 
(https://nmdc.cn/resource/genomics/sample/detail/NMDC 10018366). 

> Three original samples (Env_0020, Env 0313, and Env 0354) were sequenced twice, for 
metagenomic analysis (as shown in this table) and viral whole-genome assembly (raw data was 
shown in Extend Data Table 7), respectively. 

* The street or vendor information of some samples was not applicable, such as transport carts, 
trash cans, sewage and silt from drainage channels and sewerage wells. 
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780 Extended Data Table 7. Information of the three SARS-CoV-2 sequences from 
781 environmental samples and the four sequence from cell supernatants’. 
А Depth 
Sample Lab GISAID Accession Collection Mapped TA 
Name — А Length mean+S 
ID code Genome Raw data? date reads D) 
hCoV- 
Env_0020 EPI_ISL EPI_ISL 2902.81 
A20 19/env/Wuhan/IV 2020/1/1 29368 897978 
_seq01 10497477 17064655 924.9 
DC-HBA20/2020 
hCoV- 
Env 0313 EPI ISL EPI ISL 611.5427 
F13 19/env/Wuhan/IV 2020/1/1 28557 187396 
_seq02 408511 17064656 53 
DC-HBF13/2020 
hCoV- 
Env 0354 EPI ISL EPI ISL 170.8+78. 
F54  19/env/Wuhan/IV 2020/1/1 29820 52348 
_seq03 408512 17064657 6 
DC-HBF54/2020 
hCoV- 
Env 0313 F13- 19/env/Wuhan/IV ЕРІ ISL EPI ISL 1117.6+9 
2020/1/20 29820 225356 
_seq04 20 DC-HBF13- 408514 17064658 34.6 
20/2020 
hCoV- 
Env 0313 F13- 19/env/Wuhan/IV ЕРІ ISL EPI ISL 1474.7+1 
2020/1/21 29820 301488 
_seq05 21 DC-HBF13- 408515 17064659 362.8 
21/2020 
hCoV- 29865 1506094 3708.0- 
Env 0126  B5- 19/env/Wuhan/IV ЕРІ 151, EPI ISL 1150.8 
2020/2/29 
_seq06 P3 DC-HBB5- 10497479 17064660 
P3/2020 
hCoV- 29864 1442240 3553.0+ 
Env 0354 F54- 19/env/Wuhan/IV ЕРІ ISL EPI ISL 1143.0 
2020/2/29 
_seq07 P3 DC-HBF54- 10497481 17064661 
P3/2020 
782  * All the raw sequencing data (different from the Extended Data Table 6) and 7 genome 
783 | sequences have been uploaded onto the GISAID (China CDC Weekly, 2021, DOI: 
784 . 10.46234/ccdcw2021.255). The raw sequence data reported in this paper have also been 
785 deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics, 2021, 
786 DOI: 10.1016/j.gpb.2021.08.001) in National Genomics Data Center (Nucleic Acids Res, 
787 2022, DOI: 10.1093/nar/gkab951), China National Center for Bioinformation / Beijing Institute 
788 of Genomics, Chinese Academy of Sciences that are publicly accessible at 
789 https://ngde.cncb.ac.cn/gsa (GSA: СКА010170). The viral genomes reported in this paper 
790 have been deposited in the GenBase in National Genomics Data Center (Beijing Institute of 
791 Genomics, Chinese Academy of Sciences/China National Center for Bioinformation under 
792 accession number C AA002295.] to C AA002301.1 that is publicly accessible at 
793  nttps://ngdc.cnc.ac.cn/genbase. Raw sequence data was deposited into NCBI BioProject under 
794 accession number PRINA948658 (http://www.ncbi.nlm.nih.gov/bioproject/948658) and in 
795 China National Microbiology Data Center (NMDC) with accession numbers NMDC10018366 
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797 
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799 
800 
801 
802 
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804 
805 
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808 
809 


(https://nmdc.cn/resource/genomics/sample/detail/NMDC1 0018366). 

> Three original samples (Env 0020, Env 0313, and Env 0354) were sequenced twice, for 
metagenomic analysis (raw data was shown in Extend Data Table 6) and viral whole-genome 
assembly (as shown in this table: Env 0020 seq01, Env 0313 seq02, and Env 0354 seq03), 
respectively. 

* The collection dates for the three SARS-CoV-2 sequences (Env 0020 seq01, 
Env 0313 seq02, and Env 0354 seq03) were the original environmental sample collection 
dates. And the collection. dates for the other four sequences (Env 0313 seq04, 
Env 0313 seq05, Env 0126 seq06, Env 0354 seq07) were the date for the sequencing of the 
cell supernatants. 

d The “original” means the sequencing results of the original environmental samples. The P1 
and P3 mean the sequencing results of the cell supernatants from passage 1 and 3, respectively, 
during the virus isolation. 
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810 Extended Data Table 8. Summary of number of reads mapped to positions 8782 and 28144 in 
811 different samples”. 
Position 
Sample ID 2. 8782 28144 
A T C G A T C G 

Env 0101 А101 0 0 0 0 0 0 0 0 
Env. 0014 А14 0 0 0 0 0 0 0 0 
Епу_0015 А15 0 0 0 0 0 0 0 0 
Епу_0018 А18 0 0 0 0 0 0 0 0 
Епу_0020 А20 20 1044 24 0 3 19 1343 5 
Епу_0002 A2 0 0 0 0 0 0 0 0 
Env 0033 A33 0 0 2 0 0 0 1 0 
Env 0055 A55 0 0 0 0 0 0 0 0 
Env 0061 A61 0 0 2 0 0 0 1 0 
Env 0063 A63 0 0 0 0 0 0 0 0 
Env 0087 A87 0 0 0 0 0 1 0 0 
Env 0088 A88 0 0 0 0 0 2 0 0 
Env 0090 A90 0 0 0 0 0 0 0 0 
Env 0096 A96 0 0 0 0 0 0 0 0 
Env 0138 B17 0 0 0 0 0 0 0 0 
Env 0126 B5 0 0 86 0 0 85 0 0 
Env_0213 D32 0 0 0 0 0 0 0 0 
Env 0262 E48 0 0 0 0 0 0 0 0 
Env_0275 E61 0 0 0 0 0 0 0 0 
Env 0221 E7 0 0 0 0 0 0 0 0 
Env 0400 Е100 0 0 0 0 0 0 0 0 
Env. 0313 F13 6 13 403 4 11 644 6 6 
Env. 0333 F33 0 0 0 0 0 0 0 0 
Env. 0346 F46 0 0 1 0 0 0 0 0 
Env 0354 F54 4 0 99 0 3 149 3 1 
Env 0398 F98 0 0 20 0 0 3 0 0 
Env 0509 G93 0 0 0 0 0 0 0 0 

812 ° These data are based on raw data in Extended Data Table 6. 

813 
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